Object Localization by Joint Audio-Video Signal Processing
نویسندگان
چکیده
There are many di erent approaches either for the localization of sound sources or for tracking of visible objects in image sequences. However, most applications use only one modality, that is, they process only audio or video information for object localization. In this paper we introduce a method for the estimation of object positions based on joint audio-video information. The key technique is a modi ed decentralized Kalman lter (MDKF), where the object localization problem is viewed as state estimation. First, the position of the object is estimated based on audio and video information separately. Then, the locally estimated results are further processed in a decentralized Kalman lter for data fusion. At the output we obtain the joint estimation results. Experiments have shown that the joint estimation provides more correct localization results than obtained by using audio or video information only.
منابع مشابه
Joint audio-video object localization using a recursive multi-state multi-sensor estimator
Object localization based on audio and video information is important for the analysis of dynamic scenes such as video conferences or traffic situations. In this paper, we view the the dynamic audiovideo object localization problem as a joint recursive estimation problem. It is solved using a decentralized Kalman filter fusing both audio and video position estimates. To better take into account...
متن کاملLearning Joint Statistical Models for Audio-Visual Fusion and Segregation
People can understand complex auditory and visual information, often using one to disambiguate the other. Automated analysis, even at a lowlevel, faces severe challenges, including the lack of accurate statistical models for the signals, and their high-dimensionality and varied sampling rates. Previous approaches [6] assumed simple parametric models for the joint distribution which, while tract...
متن کاملJoint audio-video object tracking
This paper presents a object localization and tracking algorithm integrating audio and video based object localization results. A face tracking algorithm and a microphone array are used to compute two single-modality speaker position estimates. These position estimates are then combined into a global position estimate using a decentralized Kalman filter. Experiments with a model railway show th...
متن کاملEfficiency of Target Location Scenarios in the Multi-Transmitter Multi-Receiver Passive Radar
Multi-transmitter multi-receiver passive radar, which locates target in the surveillance area by the reflected signals of the available opportunistic transmitter from the target, is of interest in many applications. In this paper, we investigate different signal processing scenarios in multi-transmitter multi-receiver passive radar. These scenarios include decentralized processing of reference ...
متن کاملInteractive Controller for Audio Object Localization and Automatic Thumbnail Music Generator
In this demonstration, we provide a new interactive controller for audio object localization and an automatic thumbnail music generator for an arbitrary stereo mixed source. Those two products are based on the same audio coding framework enabling the extraction and localization operation of audio objects (e.g., vocal, guitar, drums) via the temporal quantization of spatial information [1]. In t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000